• 041305* 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 

(43) International Publication Date 
23 October 2003 (23.10.2003) 



PCX 



(10) International Publication Number 

WO 03/088037 Al 



(51) International Patent Classification^: G06F 9/38, 9/30 (74) Agent: DUIJVESTIJN, Adrianus, J.; Intemationaal Oc- 

trooibureau B-V., Prof. Holstlaan 6, NL-5656 AA Eind- 



(21) laternational Application Number: PCT/IB03/01364 

(22) International Filing Date: 1 April 2003 (01.04.2003) 
(25) Filing Language: English 



(2<») Publication Language: 

(30) Priority Data: 
02076524.4 



English 



18 April 2002 (18.04.2002) EP 



(71) Applicant (for all designated States except US): KONIN- 
KLIJKE PHILIPS ELECTRONICS N,V. [NL/NL]; 
Groenewoudseweg 1, NL-5621 BA Eindhoven (NL). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): DE VRIES, Tromp, 
J. [NL/NL]; c/o Prof . Holstlaan 6, NL-5656 AA Eind- 
hoven (NL). BEKOOU, Marco, J., G. [NL/NL]; c/o 
Prof . Holstlaan 6. NL-5656 AA Eindhoven (NL). AU- 
GUSTEIJN, Alexander [NL/NL]; c/o Prof . Holstlaan 
6, NL-5656 AA Eindhoven (NL). VAN GAGELDONK, 
Johan, S., H. [NL/NL]; c/o Prof . Holstlaan 6, NL-5656 
AA Eindhoven (NL), 



hoven (NL). 

(81) Designated States (national): AE, AG. AL. AM, AT. AU, 
AZ, BA, BB, BG, BR, BY. BZ, CA, CH, CN, CO, CR. CU, 

CZ, DE, DK, DM, DZ, EC, EE, ES, FI, GB, GD, GE, GH, 
GM, HR, HU, ID, IL, IN, IS, JP, KE, KG. KP, KR, KZ, LC, 
LK, LR, LS, LT. LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, NT, NO, NZ, OM, PH, PL, PT, RO. RU, SC, SD, 
SE, SG, SK, SL, TJ, TM, TN, TR, XT, TZ, UA, UG, US, 
UZ, VC, VN, YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW. MZ, SD. SL, SZ. TZ. UG. ZM, ZW). 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, BG, CH, CY, CZ, DE, DK, EE, 
ES, FI, FR, GB, GR, HU, IE, IT, LU, MC, NL, PT, RO, 
SE, SI. SK. TR), OAPI patent (BF. BJ, CF. CG, CI, CM, 
GA, GN, GQ, GW, ML, MR, NE. SN, TD, TG). 

Published: 

— with international search report 

For two'letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette, 



(54) Title: VLIW PROCESSOR WITH DATA SPILLING MEANS 



17 



00 
00 



I [ 



3 



-4 









^21 





.J 



•13 



-19 
-15 



— 11 



^2 (57) Abstract: A VLIW processor comprising: a plurality of functional units (1, 3); a distributed register file (4) comprising a 
plurality of segments (5. 7, 9), the distributed register file (4) being accessible by the functional units (1, 3); a communication unit 

^5 (11) for communication with a memory; a conununication network (13) for coupling the functional units (1, 3) and the distributed 
register file (4); characterized in that the VLIW processor further comprises a spilling device (15) for transferring data between the 
distributed register file (4) and the communication unit (1 1). 
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VLIW PROCESSOR WITH DATA SPIIjIiING MEANS 



TECHNICAL FIELD 

The present invention relates to a VLIW processor comprising: a plurality of 
fimctional units; a distributed register file comprising a plurality of segments, the distributed 
register file being accessible by the fimctional units; a communication means for 
5 communication with a memor3r, a communication network for coupling the fimctional units 
and the distributed register file. 

BACKGROUND ART 

An embodiment of the VLIW processor of the kind set forth is disclosed in 

10 WO 00/33178. 

A Very Long Instruction Word (VLIW) processor allows exploiting 
instruction-level parallelism in programs and thus executing more than one instruction at a 
time. A VLIW processor uses multiple, ind^endent fimctional units to execute multiple 
operations in parallel. VLIW processors package multiple fimctional imit operations into one 

1 5 very long instmction. 

Limitations of VLIW processing include limited hardware resources, limited 
parallelism and a large increase in code size. Limited hardware resources may be the 
fimctional units, the register file or the communication network. Anticipating these 
limitations by adding more resources has some serious drawbacks. For example, when 

20 increasing the number of fimctional units, the memory size and register file bandwidth will 
have to increase as well. Furthermore, a large number of read and write ports are necessary 
for accessing the register file, imposing a bandwidth that is difficxilt to support without a 
large cost in the size of the register file and degradation in clock speed. 

For some applications to be run on the VLIW processor the limiting hardware 

25 resource may be the register file. A large register file could be used to prevent this. However, 
this has several drawbacks. First, a register file with many registers may create critical timing 
paths and therefore limit the cycle time of the processor. Second, as the number of directly 
addressable registers increases, the number of bits to specify the multiple registers within the 
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instruction increases as well. Third, a register file with many registers occupies a large silicon 
area. 

DISCLOSURE OF INVENTION 

An object of the invention is to provide a VLIW processor architecture, which 
prevents the register file being the limiting hardware resource. 

This object is achieved with a VLIW processor of the kind set forth 
characterized in that the VLIW processor further comprises spilling means for transferring 
data between the distributed register file and the communication means. 

It is to be noted that spilling itself is known, but fix>m a completely different 
field of technology, viz. firom compiler technology. The principle is that when a compiler is 
generating machine code and there are more Uve variables than the machine has registers, 
some variables are transferred or "spilled" firom registers to memory. 

The spilling means have the capability to transfer values fix>m the register file 
to the communication means, via the fimctional units. The spilling means also have the 
capability to restore values by transferring them from the communication means to the 
register file. 

An embodiment of the invention is characterized in that the spilling means 
comprise a spill register file and a spill unit, the spill register file being accessible by the spill 
unit. An advantage of this embodim^t is that it prevents that the speed of transferring values 
by the spilling means to tiie communication means limits the clock frequency of the 
processor. 

An embodiment of the invention is characterized in that at least one of the 
segments of the distributed register file is associated with a fimctional unit that is imable to 
pass data from the segment of the distributed register file to the spiUing means, a pass unit 
being associated with the functional unit for passing data from the segment of the distributed 
register file to the spilling means. In case a functional imit, such as a ROM unit, is not able 
to pass the value that has to be spilled from the register file, the pass unit is capable of doing 
so. 

An embodiment of the invention is characterized in that the communication 
network comprises a partially coimected communication network. An advantage of this 
embodiment is that it reduces the overhead of a fiiUy connected network. 
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An embodiment of the invention is characterized in that the pass unit is part of 
the associated functional unit. An advaatage of this embodiment is that no separate pass unit 
is required, saving additional siUcon area and communication coimections. 

BRIEF DESCRIPTION OF THE DRAWING 

The features of the described embodiments will be further elucidated and 
described with reference to the drawing: 

The single figure is a schematic diagram of a VLIW processor in accordance 
with an embodiment of the present invention. 



DESCRIPTION OF PREFERRED EMBODIMENTS 

Referring to the figure, a schematic block diagram illustrates a VLIW 
processor comprising a pluraUty of functional units including functional units 1 and 3. The 
processor has a distributed register file 4 including register file segments 5, 7 and 9. The 

15 functional unit 1 can read firom register file segment 5. The fimctional unit 3 can read from 
the register file segments 7 and 9. 

The processor includes a communication unit 11 for commimication with a 
memory. The latter is not shown in this figure. The processor also includes a communication 
network 13, e.g. a bus, that couples the functional units 1 and 3, and the register file segments 

20 5, 7 and 9. 

Furthermore, the processor comprises a spilling device IS. The spiUing device 
15 and the register file segments 5, 7 and 9 are coupled via the communication network 13. 
The spilling device 15 and the functional units 1 and 3 are also coupled via the 
communication network 13. The spilling device 15 allows transferring values from the 

25 register file segmrats 5, 7 and 9 to the commimication unit 11, via the functional units 1 and 
3. Furthermore, the spilling device 15 can restore values by transferring them from the 
communication imit 11 to the register file segments 5, 7 and 9. 

Preferably, the spilling device 15 comprises a spill register file 17 and a spill 
unit 19. The spill unit 19 can read from the spill register file 17. An advantage of this 

30 embodiment is that it prevents that the speed of transferring values by the spilling device 15 
to the communication unit 11 limits the clock frequency of the processor. In some 
embodiments, values can be temporarily stored in the spill register file 17, without passing 
them to the commimication unit 11. From the spill register file 17, these values can be 
transferred to tiie register file segments 5, 7 and 9 by the spill unit 19. An advantage of this 
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embodiment is that is prevents the time consuming actions of writing to and restoring values 
from the commimi cation unit 11. 

In an advantageous embodiment, the processor also has a pass unit 21. The 
pass unit 21 caa read from the register file segment 5, and the pass unit 21 and the spilling 
device 15 are coupled via the communication network 13. The pass unit 21 allows passing 
the value that has to be spilled from the register file segment 5, in case the fimctional unit 1 is 
not capable of doing so. In some embodiments multiple pass units will be present, in otiier 
embodiments none will be present. A pass unit can be associated with a fimctional unit if that 
fimctional xmit, e.g. a ROM unit, is not capable of passing a value &om a register file 
segment. 

In the embodiment shown in this figure, the pass unit 21 is a separate unit 
associated with fimctional unit 1. In some embodiments, the pass unit 21 can be part of the 
associated fimctional xmit 1 by adding an additional operation to this fimctional unit that 
allows to pass a value from a register file segment. 

The fimctional imits can be coupled to one register file segment, as in case of 
the fimctional unit 1, or to multiple register file segments, as in case of the fimctional imit 3. 
The register file segments can be coupled to one fimctional imit or to multiple fimctional 
units. The degree of coupling between the register file segtnents and the fimctional units can 
depend on the type of operations that the fimctional unit has to perform. 

The communication network 13 couples the fimctional units 1 and 3, and the 
register file segments 5, 7 and 9. The embodiment shown in this figure has a fiilly connected 
commxmication network, which means that every fimctional unit 1 and 3 is coupled to every 
register file segment 5, 7 and 9. In some embodiments the communication network 13 is 
partially connected, so not every fimctional unit 1 and 3 is coupled to every register file 
segment 5, 7 and 9. An advantage of a partially connected network compared to a fiilly 
coimected network is that it is less expensive in terms of silicon area and it allows for a 
higher clock frequency of the processor. 

The fimctional xmit 1 and 3 as well as the pass unit 21 are coupled to the 
spilling device 15. In the embodiment shown in this figure, these couplings are part of the 
commxmication network 13. In some embodiments, one or more of these couplings may be 
separate. 

The spilling device 15 is coupled to the register file segments 5, 7 and 9. In the 
embodiment shown in this figure, these couplings are part of the commxmication network 13. 
In some embodiments, one or more of these couplings may be separate. 
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It should be noted that the above-mentioned embodiments illustrate rather than 
limit the invention, and that those skilled in the art will be able to design many alternative 
embodiments without departing from the scope of the appended claims. In the claims, any 
reference signs placed between parentheses shall not be construed as limiting the claim. The 
5 word "comprising" does not exclude the presence of elements or steps other than those listed 
in a claim. The word "a" or "an" preceding an element does not exclude the presence of a 
plurality of such elements. In the device claim enumerating several means, several of these 
means can be embodied by one and the same item of hardware. The mere fact that certain 
measures are recited in mutually different dependent claims does not indicate that a 
1 0 combination of these measures cannot be used to advantage. 
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CLAIMS: 



1 . A VLIW processor comprising: 
a plurality of functional units; 

a distributed register file comprising a plurality of segments, the distributed 
register file being accessible by the fimctional xmits; 
5 - communication means for commimication with a memory; 

a commimication network for coupling the functional xmits and the distributed 

register file; 

characterized in that the VLIW processor further comprises spilling means for transferring 
data between the distributed register file and the conmiunication means. 

10 

2. A VLIW processor according to Claim 1 whereia: 

the spilling means comprises a spill register file and a spill xmit, the spill register file being 
accessible by the spill unit. 

15 3. A VLIW processor according to Claim 1 wh^ein: 

at least one of the segments of the distributed register file is associated with a functional unit 
that is imable to pass data fi"om the segment of the distributed register file to the spilling 
means, a pass unit being associated with the functional unit for passing data from the segment 
of the distributed register file to the spilling means. 

20 

4. A VLIW processor according to Claim 1 wherein: 

die conmiunication network comprises a partially connected commimication network. 



25 



5. A VLIW processor according to Claim 3 wherein: 

the pass unit is part of the associated functional unit. 
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